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Abstract 

We prove that there exists a rearrangement of the first N elements of the trigonometric 
system such that the L 2 -norm of the square variation operator is at most O e (log 9 / 22+e (iV)). 
This is an improvement over 0(log 1 ^ 2 (iV)) from the canonical ordering. 

1 Introduction 

Let <3? := {<fin}n=i denote an orthonormal system (ONS) of functions from a probability space, 
T, to R. One is often interested, usually motivated by questions regarding almost everywhere 
convergence, in the behavior of the maximal partial sum operator 



M. fix) := max 
J y ' £<N 



^a n (j) n (x) 



n=l 



For an arbitrary ONS, the Rademacher-Menshov theorem states that ||A^/||i,2 <C log(A r )||/|| L 2, 
where the log(iV) factor is known to be sharp. However, one can do much better for many 
classical systems, for instance the well-known Carleson-Hunt inequality allows one to replace 
log(iV) with an absolute constant in the case of the trigonometric system. More recently, there 
has been interest in variational refinements of these maximal results. We define the r-variation 
operator 

/ r\ 1/r 



V r fix) := max > 



where Vn denotes the set of partitions of [N] into subintervals. Clearly, V r f is pointwise non- 
decreasing as r decreases, and Mf < V r f for all r < oo. While estimates involving the maximal 
operator typically imply statements regarding almost everywhere convergence, estimates involv- 
ing the larger variational operators typically imply quantitative statements about the rate of 
convergence. In [10], it was shown that the Rademacher-Menshov theorem can be strengthened 

to||V 2 /llL 2 «log(iV)||/||L 2 - 

In the case of the trigonometric system, it was shown by Oberlin, Seeger, Tao, Thiele, and 
Wright [13] that ||V r /||2 < r H/lb for r > 2 (strengthening the Carleson-Hunt inequality). In 
the case r = 2, one can deduce the inequality ||V 2 /||2 -C ylog^iV) 1 1 / 1 1 2 from the Carleson-Hunt 
inequality (see [TO], Theorem 3). Moreover, the y / log(A r ) can be shown to be sharp (see |13j . 
section 2). 

When considering questions regarding partial sums of an ONS, the ordering of the system 
plays a crucial role. For instance, Olevskii [14j has shown that any infinite complete ONS can be 
reordered in a manner such that almost everywhere convergence fails for some L 2 function. The 
related question of whether an ONS can be reordered in a manner such that almost everywhere 
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convergence holds for every I? function is known as Kolmogorov's rearrangement problem and 
is one of the central open problems regarding orthonormal systems. Going further, Garsia has 
conjectured [B] (see also [3]) that given an ONS <3? := {(/>n}n=n one ma y a permutation 
<r(n) : [N] —¥ [N] such that the reordered system {^> a (n)}n=i satisfies ||A^/||2 <C H/lb where 
the implied constant is absolute and independent of the system. This is known to imply an 
affirmative solution to Kolmogorov's problem. As partial progress towards Garsia's conjecture, 
Bourgain [3] has shown (for uniformly bounded systems) that one may always find a permutation 
of [N] such that ||M/|| 2 < log log(iV) 1 1/| | 2 - In [TO], this was strengthened to ||V r /l|a <r 
loglog(A r )||/||2 for r > 2. While these estimates have strong consequences for very general 
orthonormal systems, their conclusions are weaker than what is known to be true for most 
classical systems (such as the trigonometric system) in their canonical orderings. 

With this in mind, it was asked in [TO] if one could improve the inequality ||V 2 /||2 <C 
Y / log(A r )||/|| 2 by reordering the first ./V elements of the trigonometric system. Here we provide 
an affirmative answer to this question by proving the following theorem: 

Theorem 1. Let <1> := {^ n }n=i denote a uniformly bounded by some constant A. Let 

e > 0, 7 > 1. Then, with probability at least 1 — o/V -7 (for some universal c), for a uniformly 
random permutation a : [N] — > [N], the system {4> a (n)}n=i w ^ satisfy 

||V 2 /I| 2 «A,,7 log^ +£ (iV)||/||2. 

It turns out that treating the V 2 operator requires a considerably more delicate analysis than 
the maximal and r-variation (r > 2) cases previously studied. Indeed, the Dudley-type chain- 
ing/covering number methods used in [3] and [TO] alone seem incapable of achieving anything 
better than ||V 2 /|| L 2 <C log 1/2 (A7")loglog(iV)||/|| L 2 (see [TO]). The limitations of these methods 
have been previously overcome in the context of related problems, most notably Bourgain's 
solution to the A(p)-problem [3], as well as Talagrand's alternate approach and generalizations 
|16j . The probabilistic component of our current work will use tools from both Bourgain and 
Talagrand's works, however additional complications enter as we will need to work with random 
subsets of a much greater density and derive stronger concentration bounds (see Section [7] for 
further discussion of these issues and an overview of this part of our argument). A careful 
combinatorial organization is also needed to reduce estimates for the V 2 operator to questions 
amenable to these probabilistic methods. Here we rely, in part, on ideas from Taylor's work 
[19j on the path variation of Brownian motion. 

We do not expect that the exponent is sharp. In the maximal case, it is known that 
Bourgain's estimate ||.M/||2 <C log log(iV) 1 1 / 1 1 2 is the best one can achieve with a purely prob- 
abilistic argument (see remark 2 in [3]). It is consistent with our knowledge that probabilistic 
arguments might be able to achieve ||V 2 /||2 "C log log ( A^) 1 1 / 1 1 2 ; although this would surely re- 
quire a much deeper analysis. In [TOJ (see Theorem 6) it was shown that for any bounded ONS 
one may find a function / such that ||V 2 /||2 3> \/log log(iV)||/||2, which gives a lower bound 
for any ordering. This fact is closely related to the law of the iterated logarithm (see [9]). 

2 Preliminaries 

We use the notation x <C y, for instance, to mean that there exists an absolute constant C such 
that x < Cy. We similarly employ notation like x <^. p y to mean that there exists a constant 
C p depending only on p such that x < C p y. We will use [N] to denote the set of the first iV 
natural numbers {1,2,..., iV}. 

1 We have defined an ONS to be real-valued, however the result follows for the complex-valued trigonometric 
system by applying the result to the real and imaginary parts separately. 
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Remark 1. Throughout this paper, we will consider orthonormal systems $ := {4> n }n=i un ^~ 
formly bounded by a fixed constant A, meaning that \4> n (x)\ < A for all n £ [N] and all x £ T. 
We will refer to these simply as bounded orthonormal systems. Since A is fixed, we allow 
dependence on A in all implicit constants (such as the asymptotic notations <C and O(-)) which 
we will not always explicitly state. 

We let T denote a convex, symmetric function r : R — > M. + such that T(0) = 0, T is increasing 
on R + , and T(t) tends to infinity as t tends to infinity. The (Luxemburg) Orlicz space norm 
associated to T is then defined by: 



Definition 2. For f : T 



We define the following convex function from R to R, parameterized by a value f < K < oo 
and a value 2 < p < 3: 

j \t\ p , if |*| < K; 

r K (t) :- j ^ + ^ Kp _ 2t2 _ ^ K ^ . f |t| > 



We also define 



|t| p - 2 , if |*| < K; 
KP- 2 , if \t\ > K. 



lK{t) := 

We observe that t 2 -f K (t) < T K (t) < (l + t 2 -f K (t). 

Lemma 3. For any fixed 1 < K < oo and 2 < p < 3, z* ZioZds for all s,t £R that 

\sjk(s) - t lK (t)\ < 3( 7x (s) + lK{t))\s - t\. 

Proof. Without loss of generality, we may assume that \s\ > \t\. We define j3 by t = /3s, where 
|/5| < f . For notational concision, we also define a = p — 2. We first consider the case where 
\s\, \t\ < K. In this case, we consider the quantity 

\s 1K {s)-t 1K {t)\ = \ s \ i+a {i-m a )- 

We must establish that this is upper bounded by 3|s| 1+a (I — /3)(1 + Since \j3\ < 1 and 

a< 1, we have \f3\ a > |/3|, and hence (1 - /3)(1 + |/3| a ) = 1 - /3 + - /3|/?| a > l-f3\/3\ a . The 
desired inequality follows (note that the factor of 3 is not needed for this case). 

We next consider the case where \s\ > K and \t\ < K. Here, we wish to bound the quantity 
\sK a - t\t\ a \ by the quantity 3|s - t\(K a + \t\ a ). We suppose that s > K > t > 0. Then, it 



suffices to show that 



which holds if and only if 



s |f<> _ /-!+<* 

<3K a , 



i-t 
t 1+a K- a 



< 3. 



s-t 

We define 0<c<lbyt = cK and a > by s = (1 + cr)K. Then we have: 

s - t 1+a K- a _ I + a - c a+1 

S — t I + <7 — C 



3 



We observe: 

1 + a - c a+1 1 - c l+a a 1 - c 1+a a 

— = 1 < h -. 

l + o" — c l + o" — c l + o" — c 1 — c a 

Since a < 1 and c < 1, we note that c 1+a > c 2 , so this is < 1 + c + 1 < 3, as required. 

We now suppose instead that s > K, and — K < t < 0. In this case, we need to show that 

H K a + \t \ 1+a 
s + |t| 

Dividing out if", we obtain the equivalent requirement 

s + \t\ l+a K- a 



s + lil 



< 3. 



Using that \t\ < K < s, we observe that — 1 s+ u\ < if = 2, and so the desired inequality 

holds. The cases where s < —K and \t\ < .K" can be handled symmetrically. 

Finally, we must consider the case where \s\, \t\ > K. In this case, we wish to bound the 
quantity 

\sK a - tK a \ = K a \s -t\ = hcf K {8) + lK{t))\8 - t\, 
so this is clearly < 3\^k(s) + 7jr(t)| ■ \s — t\ as required. □ 
Lemma 4. For K > 1, for all t we have that 

r K (t) < tp 
r K (t)< (i + n^) KP ^ 2 - 

It follows thatforf-.T^Rwe have \\f\\r K < \\f\\ p and \\f\\r K < (l + v -^)' 2 K^l 2 \\f\\ 2 . 

Proof. The inequality T K (t) < (l + S+ 2 -) i^^ 2 t 2 is clear from the definition of T K (t). We 

prove Tx{t) < t p . This is clear for \t\ < K, so we assume \t\ > K. We let < c < 1 be defined 
by c := — . ^ then suffices to show 

1 + EZ«V-f!^Wl. 



Setting the derivative (with respect to c) equal to 0, we see that 

• -i + ^) -M-<-<- ^) - 3 " (^) ^ 

This implies that (1 + 2 ^)c p_3 = fc^ 1 and it follows that c = 1. Lastly the inequality can be 
easily verified at c = 0, 1. 

Let / be a function from T to R, and let A = ||/|| p . Now, using Tx{t) < t p , 

J T K {f/\)<\\f\\-v J i/r<i. 

Similarly, setting A = (l + ^) i^ (p " 2)/2 ||/|| 2 and using T K (t) < (l + 2z?) ^P-2f2 ? we 
have 

-2 



M//A)< ((i + ^) 1/2 ^ 2)/2 ll/l| 2 j / (i + ^)^- 2 l/l 2 <i- 



□ 
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Given Tk, we also define its dual, T* K : R — > R, as 

1*0=) = f [A {T' K y l {t)dt 

Jo 

By a straightforward computation, we have for t > 0: 

(r'K) _1 (t) = 



t/ ( P KP- 2 ), iftypRp- 1 . 



We then compute that for < x < p.fr p 1 



T* K {x) = J I -J dt = p ~{p-l)x~. (1) 



For x > pK p 1 , we compute 



We call the dual of Yk because || • ||r* is the equivalent to the dual norm of || • \ \r K - 
More precisely: 

Lemma 5. There exist (universal) positive constants C\,Ci such that, for all f, 

d sup ffg< \\f\\r K <C 2 sup ffg. 
\\g\\ r * K <lJ llsllr^<l^ 

This follows from the standard theory of Orlicz spaces (see Chapter 2 of [7], for instance). 
Lemma 6. For any measurable f : T — > R, we can decompose f = f\ + f% such that 

H/ilk « \\f\\v K and \\h\\ L . « K^ 2 \\f\\ TK . 



Proof. By homogeneity we may assume that 1 1/| |r x = 1- We let Is denote the indicator function 
of a set S C T. We now define 

h ■= f ■ \ x:[ £M [<K} and h ■= f ■ K {a;: |/M M - 
Using the hypothesis that ||/||r K = 1> we have 

/ r*(//2) = / {f/2Yl {x . m<K}+ J ((l + ^) KP-\f/2? - (^) Kp) I 



{x ;\il2l\>K} 



It follows that 



which is equivalent to ||/i|| p < 2 (or ||/i|| p < 2||/||r K ). Next we have that 



l + P —^\KP-\f/2?-( P —l)KP)l 



{x: \£M\> K} 



/( 



1 + ^) ^(//2)' I{I „ZM |SK) - , ({« : |^| > K}) ^ < 1, 



where \i denotes the Lebesgue measure. 

Since we are assuming H/Hr^ = 1 an d ^K{f{x)/2) > K p whenever \f(x)/2\ > K, we must 
have n (lx : > Kf) — K~ p . Combining this with the above inequality, we see that 



1 + 



^(//2)V,M 



{xW|>K} 



< 1 + 



p-2 



This implies 



y (/ 2 ) 2 < 4K 2 - p . 



□ 



Lemma 7. We tae ffcttf ||/|| 2 < pKV- l \\f\\ v * K . 

Proof. Referring to the computation of T* K in ([2]) , we see that 



Let 



1, then 



2pKP 



2pKP~ 2 ' 

(f)f 2 < / r* K (f) < l. 

* Jt jt 



(3) 



We also note that 
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\ 2 = f I< pKP -i(f)f+ [ l> pKP -i(f)f 2 < P 2 K 2p - 2 + [ I> pKP -i(f)f 2 . (4) 

JT JT JT 

Combining © and ©, we have ||/|| 2 2 <^p 2 K 2p ~ 2 . 

□ 

We will now recall several probabilistic results that we will need later. The following lemma 
(see p. 229 of [2]) is the chaining argument from Bourgain's work on the A(p)-problem: 



Lemma 8. Let £ C M. N and B := sup xg £ 1 1 a; 1 1 ^2 (the diameter of £). Let < 5 < 1 and {Ci}iL\ 
independent 0, 1-valued random variables (selectors) with mean 6 = J £i(uj)duj and 1 < m < N. 
Then for any q > 1, 



sup 

x&£,\S\<rn ieS 



Li{dw) 



5m 



log(l/<5) 



1/2 + log" 1/2 (lA) I* \og x '\N 2 {S,t))dt. (5) 

JO 



Here, N2(£,t) denotes the entropy number of £ with respect to the ^-distance. In other 
words, N2(£, t) is the minimum number of ^ 2 -balls of radius t needed to cover the set £ . 

The following lemma (see p. 231 of [2]) is the entropy estimate from Bourgain's work on 
the A(p)-problem. There it is stated for orthonormal systems uniformly bounded by 1, but it 
generalizes easily to those uniformly bounded by a fixed constant A: 

Lemma 9. Let $ := {4>i}f =l denote an orthonormal system of functions uniformly bounded by 
A. Further let 1 < m < N and 2 < q < 00 and define 



V m := < y%, 
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Then, for some v q > 2, 



{ N \ 1 
log (N q (V m , t)) < A C q m log — + 1 r v « forty - 

\m J 2 

log (N q (V m ,t)) < A C q m log f — + 1 j log(l/t) /or < t < ^. 



(6) 



(7) 



The following lemma appears as Theorem 5.2 in [16| (see also [T7] and [18] ). This deep 
and general result is a key technical component of Talagrand's alternate approach to the A(p)- 
problem and is closely related to what is often called the majorizing measures theorem. 

Lemma 10. Consider an operator U from l 2 N to the Banach space of real valued functions on 
T with a norm \\ ■ \\. Further assume that \\ ■ \ \ < || • || p for some p > 2. Now consider N 
independent mean 5 selectors {£i}£Li and consider the random subset S := {i € [N] : £j = 1}. 
Then, if we denote by U\s the restriction of domain of the operator U to sequences supported 
on S, we have that 

i2 + C p log(iV) 



*[\\U\t 



io g (i/a) 



(8) 



The following lemma appears as Theorem 7.4 in [8] and provides a strong concentration 
bound for an empirical processes. This is also due to Talagrand [15], although from work on a 
somewhat different topic. 

Lemma 11. Let Y\, . . . , lj\r denote random variables taking values in a Banach space W, and 
T a countable collection of measurable functions on W pointwise bounded by C (i.e. \f \ < C). 
Set Z := sup /g jrX]iIi f( Y i) and a 2 = sup /eJ rX]iIi \ f( Y i)\ 2 - Then, for all t > 0, 

Ct\\ 

(9) 



P (\Z - E(Z)\ >t)< 3exp f -— log + ^ 

for some absolute constant k. 

We will also need the following standard fact: 

Lemma 12. Let {Vy)y^E be a set of real, non-negative random variables indexed by a finite set 
E of size N . Let q > 1. Then 



sup V v 



< SUp||Vy||g + l 0g Ar. 

y 



Proof. We define Z := sup y Vy and let n > 1 be a parameter to be set later. Applying Jensen's 
inequality, we have (E[Z]) ,? < EfZ'']. We note that Z v < ^yVy 71 , so by the linearity of 
expectation we have 

(nz]r<j2nv y qr >}<N S up\\v y \\^. 

y y 



We then observe: 



sup V y 

y 



(E[Z])« < Nm sup ||V, 



y\\qv 



We then choose r\ so that rjq = q + log N. We then have < N l °s N 1, so the lemma 
follows. □ 



Finally, we recall Pittel's inequality, which can be found on p. 38 in pQ, for example: 



7 



Lemma 13. For positive integers 1 < m < N, we let (^) denote the set of all subsets of [N] 

of size m. We let S m denote a uniformly random element of (^). We let S denote a subset 
of [N] chosen by including each element of [N] independently with probability 5 = m/N. Then, 
for any event E described as a set of subsets of [N], we have 





< ^fm ■ P 


See 


\ m J 







3 Dyadic Decompositions 



Let <E> = {4>i}i = i be an orthonormal system and fix / = J2 n e\N] a n (f> n in the span of the system. 
We now perform a dyadic decomposition of / in terms of the £ 2 weight of the coefficients (which 
we refer to as 'mass'). We reproduce the description of this decomposition from |10j . 



We define the mass of a subinterval / C [N] as M(I) := 



■. By normalization, 

we may assume that M([iV]) = 1. We define 10,1 := [N] and we iteratively define Ik >s for 
1 < s < 2 k as follows. Assuming we have already defined Ik—is f° r all 1 < s < 2 fe_1 , we will 
define Ik2s—i an d ^fc,2sj which are subintervals of Ik— is- Ik2s—i begins at the left endpoint 
of Ik—x,s an d extends to the right as far as possible while covering strictly less than half the 
mass of Ik—is-i while Ik 2s ends at the right endpoint of Ik— is an d extends to the left as far 
as possible while covering at most half the mass of Ik— is- More formally, we define Ik2s-i 
as the maximal subinterval of Ik-i, s which contains the left endpoint of Ik-i, s an d satisfies 
M(Ik,2s-l) < \M{Ik-\,s)- We also define Ik,2s as the maximal subinterval of Ik-i,s which 
contains the right endpoint of Ik-i, s an d satisfies M{Ik^s) < \^{Ik-i,s)- We note that these 
subintervals are disjoint. We may express Ik-i, s = Ik,2s-i U Ik,2s U ^k,s, where ik )S £ Ik-i,s- in 
other words, ik s denotes the single element which lies between Ik2s-l an d Ik,2s (note that such 
a point always exists because we have required that Ik,2s-l contains strictly less than half of the 
mass of the interval) . Here it is acceptable for some choices of the intervals in this decomposition 
to be empty. By construction we have that 



For J C \N], we define 



We also define 



M{h,s) < 2-K 
fj(x) = ^2a n <p n (x) 



(10) 



fj(x) := max 



y^a w n (a 



nel 



where max/cj denotes the maximum over subintervals. 

Lemma 14. Let $ = {(f)i}iL± denote an orthonormal system such that \4>i(x)\ < A for all i,x 
and for every f = Ylne[N] a n4>n in the span of the system one has 

H/||iV<A||/|| 2 

for some constant A > 2. Then we may decompose the maximal function f = f\ + fi where 

||/i|| P « P A||/|| 2 
||/ 2 || 2 « p Alog(iV)K( 2 -^ 2 l 
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Proof. We normalize ||/|| 2 = 1 and split f = fi + fs where f L := ^2\ an \ >N -i a n <j) n and f s := 
E|o„|<iV-i a n4>n- Now / < f L + f s and f s < E| a „|<JV-i \ a n\\4>n\ < A. Clearly ||/ 5 || p < ||/|| 2 , 
so it remains to prove the desired properties for f^. 

We next perform the 'mass' decomposition described above on f^. On each interval Ii- S in 
the dyadic decomposition of fz, we can apply Lemma [6] and the hypothesis ||,f||rv < A 



l/p 



to express the restriction of fi to 1^ s as the sum of two functions Gk s an d Ek s such that 
||Gjt jS || p < AH/4 JI2 and | |^fe,s 1 12 < AK^'^^l \fi ks \ | 2 . Now, at each point x, the maximizing 
subinterval achieving the value fzix) can be decomposed into a disjoint union of dyadic intervals 
(and points) using at most two intervals on each level. We then have the pointwise inequality 

riog(JV)l / \Vp / \Vp 

h< E Ei^+^r +E Ei/^i p 

k=l V s ) k \ s ) 

flog(JV)l / \ VP riog(JV)l / \ VP / \ 

E Ei g mM + E Ei^r +E 

fc=l V s / fc=l V s / fc \ s / 

where we have used the condition \a n \ > N' 1 to restrict the sum in k to |~log(iV)] values. (We 
have also used that taking the £ p -norm of the values of the intervals on a single level provides 
an upper bound on the highest value taken on that level.) 

Note that it suffices to prove that an appropriate decomposition exists for a pointwise 
majorant. We treat each of the three sums on right of the above inequality separately. By 
hypothesis, we have that ||G fc , s || p < A\\f Ik J\ 2 and \\E k , s \\ 2 < Ai^( 2 ^/ 2 ||// fc J| 2 . We then see 
that 



[log(iV)l 




riog(jv)i 




E 


(?' GM *f 


£ E 




k=l 




k=l 

P 





Rog(A01 / 

E Eiig 



/•slip 



i/p riog(iv)i 

< J2 A2 fe ( 1 /P-i/2) <p A 



k=l 



k=l 



Here we have used that 
Next we have 



< 2- k ' 2 . 



flog(JV)l / 

E 5>*.« 



i/p 



k=l 



|Tog(JV)l / 

< E Eii^ 



< 



1/2 




1/2 



< Alog(iV)if (2 ~ 



P)/2 



fc=l 



Recall that /, 



j for some j G [iV]. By hypothesis we have | (x)| < A for all x, 



furthermore \aj\ < 2 ( k x )/ 2 since j G Ik— is' f° r some s'. Thus |/i fe J <Ca 2 ^ x ^ 2 . Hence, for 
any x, 



i/p 



E E i/U*)i p « E E 2 ^ 2)(fc ~ 1)/2 i^>)i 2 « ^2- (p - 2)(fc - 1)/(2?3) «p i- 



Since this term is pointwise bounded, clearly its L p norm is also bounded. Since the decompo- 
sition holds for each of the three terms it clearly holds for their sum, completing the proof. □ 
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We now record the following easily verified fact (see Lemma 29 in [10J ) . 

Lemma 15. We fix \ \fW2 = 1 (in the span of {4>i}f =l ) and let A denote the set of intervals 
that occur in the mass decomposition of f. For every interval J C [N], (J ^ %), there exist 
Ji, J r G A and ij G [N] such that J := Ji U i j U J r is an interval (i.e. Je,ij, Je are adjacent), 
J C J, and M(J) < 2M(J). 

Lemma 16. Let $ = {4>i}f =l denote a bounded orthonormal system. Furthermore, let M < N, 
let C',r > be positive constants, and let K > 2. We assume that for any interval I C [N] of 
length \I\ < M we have for any h = J2nei a n4>n, the estimate 



C'log r (N) 
log r (JV/|J|) 



holds. In addition, we assume that 

||A^/|| 2 «loglog(iV)||/|| 2 

holds for any f in the span of the system. Then, if f = Ylnei a n4>n for \X\ = M < N, for any 
(3 > we have 

\\V 2 f\\ 2 < P (log^ 2 (iV) + {Cy^log^-^-^^iN^og^-^^iN/M) 

+ C'\og r+ \N)K^/ 2 + \og\og{N))\\f\\ 2 . (11) 

Proof. Let / = £ neX a n4>n with \X\ — M and X C [N\. Normalize H/H2 — 1, and again 
split / = /l + fs where f L = T,\a n \>N-i a n4>n and f s = E|a n |<Af-! a n4>n- We then have 
V 2 fs < Wn4>n\ 1 and may restrict our attention to /l. 

We now perform a mass decomposition on fi, using the notation above. Let I be a dyadic 
subinterval. We consider the restriction of fi to this interval, and we apply Lemma Q3] to 
decompose the maximal version of this restriction as a sum of functions Gj + Ej. We may 
apply Lemma [HI to the system {0j}j g / with A := ^"^^ . We then obtain 



a iog r (iv) 

" p log r (iV/|/|) 



\Gl\\p i TV at- /I m ll/j||2) 



For a fixed x G T, we know that the value of V 2 /l(x) is achieved by some maximizing 
partition tt. Appealing to Lemma [To] we know that each interval in tt can be covered by dyadic 
intervals and points (and each dyadic interval/point will only be used to cover at most a constant 
number of the original intervals). Let A denote the family of dyadic intervals and points. We 
write tt* C A to denote a suitable covering of a partition by dyadic intervals and points. For 
each k, we define 1% := {I M s.t. |J M | < 2- fc / 2 M} and i| := {J fcjS s.t. \I k>s \ > 2~ k / 2 M}. We let 
I a denote the union of Ii over the values of k let I b denote the union of l\. We then have 

|V 2 / L (x)| 2 «sup^|/ 7 (x)| 2 « sup l^|//| 2 + Y, !/<*.. 



<C sup ^ |/ 7 | 2 + sup \M 2 + SU P S 

-* c ^/e^ -* c ^/e^ ^ c ^ fca e^ 
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The third quantity here can be easily handled. Since fc k = ai k s 4>i k and Yli a i = 1) the 
contribution from the points here is <C 1. The second sum may also be estimated in a crude 
manner. Recall that ||// fc J|| <C (log log(iY)) 2 2~ fc (by assumption) and note that < 2 fc / 2 . 
Thus 



/ sup ^|//| 2 «^ Y. (loglog(iV)) 2 2^«(loglog(iV)) 2 ^2^/ 2 «(loglog 



(AT)) 2 . 



6 -fc.sfcJfc 



We return to the first sum. We fix f3 > 0. For each dyadic interval /, we define the 'bad' set 
Bj = {x € T : |G/(a;)| 2 > log^(iV)M(J)} and we let Hj denote its complement inside T. Then, 



/ sup ^|//| 2 « / sup Y,\Gi + Ei\ 2 
i&i a iei a 

« / sup V \l Hl Gi\ 2 + [ sup V \IbA\ 2 + [ sup V \Ej\ 2 . 

j &I a j &I a j eI a 

By Definition, |I Hj Gj(a;)| 2 < log? (N)M(I) for all x, and since the intervals in the partition 
7r are disjoint, we have that J2i£n* <C 1. Thus 

/ sup V \I Hl G!^)\ 2 < 
iei a 

Next, using that / |^ fc , s | 2 < P (C) 2 log 2 ( r+1 )(iV) \og~ 2r (N /\I kyS \)K 2 -P M(J M ), we have 

/ sup V |^| 2 « p (C') 2 log 2r+2 (iV)^ 2 -P. 
J **cA Ien * 

I<El a 

It remains to estimate the contribution of the bad sets. By Chebyshev's inequality, we have 

f % < o < p (C") p log pr (Af)log- pr (iV/|4 s |)log- /3p / 2 (iV). 

J ~ log^/ 2 (iV)(M(I M ))P/ 2 py ' K ' K n ' U V ^ 

Now, by Holder's inequality, we have 

r / r \V(p/2)' / r \2/p 

J K,A S | 2 < [J W.J (J \Gk,s\ P J 

where (p/2)' denotes the conjugate exponent of p/2. This is 

« p ((CT log^(iV) log^(iV/|/ M |) log-^ 2 (iV)) V(P/2) ' (C'f log 2r (N) log~ 2r (N/\I kyS \)M(I k , s ) 

= (C'f log^(TV) log-^(iV/|4 iS |) \og-^l 2 (N)M{I Ks ), 

using that l/(p/2)' = (p — 2)/p. Since |/fc jS | < 2~ k l 2 M for I k s G I a , we have that (assuming 
pr > 1) 

r /log(JV) \ 

/ Yl KA-I* « P (C') p log^(^V)log^ (p - 2)/2 (iV) £ (log(iV/M) + 
k,s,i k , s ei a \ fc=i / 

/»oo 

< p (C") p log pr (A^)log- /3 ( p - 2 )/ 2 (7V) / x- pr dx < p (C / ) p log pr (Af)log- /3 ^ 2 )/ 2 (Af)log 1 - pr (A f /M). 

Jlog(N/M) 

□ 
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Corollary 17. With the same hypothesis as Lemma\M if C \og r+1 (N)^ 2 ^/ 2 < 1, \og(N/M) < 
log e (iV), and r + 0/p - #r > 0, then 

l|V 2 /l|2« P ,c log r+9/p - er (iV)||/|| 2 

for all f = J2nel a ^ with PI = M - 

Proof. By hypothesis, the third term in ([lip is <C 1, so we only need to choose a /3 > to balance 
the first two terms. Solving /3/2 = rp/2 — (3{p— 2)/4+0(l— pr)/2 we see that (3 = 2{r + 9/p—9r). 
Taking this choice of f3 in ([lip yields the corollary. □ 

At the beginning of this section we introduced a 'mass' dyadic decomposition of [N] with 
respect to a function / = J2 n e[N] a «0n- Now we recall the more common 'length' dyadic 
decomposition. Without loss of generality, we assume N = 2 l for some positive integer £ (if iV 
is not a power of 2 we will simply round up to the nearest power of 2). Consider the collection 
of dyadic subintervals of the form Xfc jS = (s2 k , (s + l)2 fc ] for each < k < £, < s < 2 e ~ k — 1. 
Note that the we have used a slightly different indexing convention here, compared with the 
mass decomposition. 

Lemma 18. Let J C [N] be an arbitrary subinterval. Then we may decompose J = JiL) J r as 
a union of disjoint intervals Ji,J r where (i) at least one of Ji and J r is non-empty, and (ii) 
Ji Q T-k,s f or some k,s where |J/| > ^|Ifc iS | (assuming J\ is non-empty), and the same holds for 

Proof. We define b G J to be of the form m2 k where k is maximal. In other words, we consider 
all multiples of powers of 2 inside J, and we set b to be one of the multiples of the highest power 
appearing. We set Ji := {1, . . . , b} D J and J r := {b + 1, . . . , N} n J. We consider | Jj| and we let 
k\ be the minimal integer such that 2 kl > |«7j|. Then we must have k\ < k (otherwise, a multiple 
of 2 k+1 would appear in J, contradicting maximality of k). Thus, there is a dyadic interval of 
length 2 kl that covers Ji and has length < 2\J\\. An analogous argument applies to J r . □ 

As above, if f(x) = J2 n e[N] a n*/ ) n( x ) ^ s fixed and I C [AT - ] is an interval we will write 

// = £ ne / fln^n and // — maxjc/ | £neJ a n^n(2;)| (where the maximum is over all subintervals 
of I). 

Lemma 19. Let & = {(fri}iLi denote a bounded orthonormal system such that 

||W|| a <loglog(iV)||/|| 2 

holds for all f in the span o/<I>. Suppose we have < 9 < 1, r > ; andp > 2 such that for each 
subinterval I C [N] of length M = 2 m where M is the largest power of 2 that is < N2~ log ( N \ 

||V 2 /l|2«log r+e/p - er (^)ll/ll2 

holds for any f = £ neJ a n (p n . Then, for any f = JZ n e[N] a n<\>n, we have that 

||V 2 /|| 2 « (log e / 2 (iV)loglog(iV) +log r+e ^(iV))||/|| 2 . 
Ln particular, when 9 = 1/2+r-i/p ' we °^^ a * n 

||V 2 /|| 2 «log^^(Ar)loglog(iV)||/|| 2 . 
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Proof. Setting f(x) = J2 n e[N] a n4>n{x), we may normalize 
to estimate the quantity 



G[N] a n 



max > 

ZS7T 



1. We now wish 



(12) 



where Pa? ranges over the set of all partitions of [N]. From the elementary inequality (a + b) 2 < 
3(a 2 + 6 2 ) we have that | J2neJ a n<t>n{x)\ 2 < | Y] ng j ; a n</>nO*0| 2 + I EneJ r a n0nOc)| 2 whenever 
J = JiU J r for disjoint intervals Ji,J r . Using Lemma [T8l it follows that we may restrict the 
set of partitions Vn in (H2J) to the subclass V* N of permutations where each interval in each 
partition is contained in a dyadic interval of at most twice its length. That is, 



max > 

,,-p-p* Z — / 



y^q n TO (a;) 

nG/ 



For a fixed x, let 7r(x) denote the partition in achieving the maximum in the above 
expression at x. We then have that the above quantity can be expressed as 



E 

Ie-rv(x) 
\I\>N2- 21 °s ( N ) 



y~]a n (l>n(x) 



+ 



E 

|/|<Af2- 21 °s e ( JV ) 



B 1 + B 2 



We now consider the contribution from the quantity B\. Let / E 7r(x), since ir(x) G "P^ 
there is a dyadic interval J/, such that |I| < \ Jj\ < 2\I\. Since the intervals in n(x) are disjoint, 
and the associated dyadic interval Jj has length at most 2|/|, it follows that any particular 
dyadic interval is associated to at most 2 intervals in tt(x). We recall £ = log(iV). It follows 
that 

E E l4, s (x)| 2 «log e (iV)(loglog(iV)) 2 , (13) 

0<k<2E e 0<s<2 e ~ k 

where we have used that / |//| 2 <C / \Mfi\ 2 <C (log log(iV)) 2 Yln&i a n- 

Next, we must estimate the quantity B 2 . Let M = 2 m denote the largest power of 2 less 
than or equal to N2~ Thus i^-^W < M < N2~ lo & 9 ( N ). Consider the partition 

of iV into (dyadic) subintervals of length M, {2m,s}s=o _1 - ^ now follows from the hypothesis 
that on every interval in this partition X^g, we have ||V 2 /||2 <C log' r+£ '/ p ^'(^Oll/lb for any 

/ = £n G J m , s a "^"- 

Next we note that each of the intervals in the partitions in B 2 is contained in a dyadic 
interval of length at most 2 x _/V2~ 21og < M. By the nesting of dyadic intervals, it follows 
that each interval in the sum defining B 2 is strictly contained in an element of the partition 
{Zm^s}. Thus one has 



5 2 «^|V 2 /l m J 2 «log 2 ('- 



-0/p-Or) 



{N)\\f\\l 



(14) 



Combining [13] and [TJ] (and taking square roots) we have that 

||V 2 /|| 2 « (log e / 2 (iV)loglog(iV) +log^/^(iV))||/|| 2 . 



□ 
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4 Probabilistic Estimates 



4.1 A First Estimate 

We begin by establishing a close variant of Lemma 3.4 in [JJ, mainly following the proof in [3] 
having taking more care with logarithmic factors. 

Lemma 20. Let fa, . . . ,<j>^ be orthonormal functions on a probability space T,fi satisfying 
\fa\ < ^4 everywhere on T for all i. Let < 5 < 1 and let {£i}iLi be independent, {0, 1} -valued 
random variables of mean 5 on a probability space fi. Let 1 < K < oo, 2 < p < 3, 1 < m < iV 
and qo > 1. Then there exists a 



1 p-2 

A <C„ 8±K 2 + 



log(V 



+ 



logiV \ 2p 



iog(i) 



sup / 

|5|<m JT 
| Cl | 2 <2/m 



< 1. 



LV) (duj) 



Proof. For a fixed A > (to be set later) and any fixed well and W of size < m and coefficients 
Cj satisfying |cj| 2 < 2/m, we observe that 



) « A- 2 • -U sup V |(&,<ra-(A _1 0)>| , (15) 



where 



:= S 5 = ^2 difa | ^ \ai\ 2 < 1 and IS 1 ] < m > 
I i£S i J 



We define £ CR N as 



£ 



■■={(\(fa,9jK(x~ l g))\)ti \ 



We also define B := J2xe£ ll x ll^ 2 - 

Then, applying Lemma we have 



sup V &H • \(fa,gjK(^ 1 g))\ 
g^r m ,\w\<m ieW 



LW (dw) 



« Um + 



qo 



B + I log | - 
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(logA^i)) 2 ^, 



(16) 



where N2(£,t) denotes the entropy number of £ with respect to the £ 2 -distance. 

We now derive an upper bound on B for our set £ . We note that for any g G V m , we can 
apply Bessel's inequality to obtain 



Y.K^giKix-'g))^ < / M 2 |7*(A- 1 <?)| 2 
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Using the fact that \<pi\ < A, we have that for all x G T, \g(x)\ <C ^J a «l — °y the 
Cauchy-Schwarz inequality. Thus, since ||<?||l2 < 1, we have B <C {^jf)' 

Next we address the quantity N2(£,t). For arbitrary g,h G P m , we consider the quantity 

i i 

(K^t^A"^)}! - 1<&, ^(A- 1 ^))!) 2 ^ 2 < \^91k{x- x q) - h lK {\- l h))\^ 2 , 

using the fact that for any real numbers a and b, \\a\ — \b\ \ < \a — b\. By Bessel's inequality, this 
quantity is < ||57x(A" 1 g) - /i7x(A _1 /i)| \ L 2 . 
Applying Lemma [3l we see this is 

« ||b- h\ ■ hK(\- 1 g) + 7K(\- 1 h)\\\ L2 < X-^\\\g - h\ ■ (\g\P~ 2 + W- 2 )\\ L2 . 

Noting that \g\ p ~ 2 + \h\P~ 2 < 2(\g\ + |/i|) p ~ 2 , this is <C A"^ 2 ) \\\g - h\ ■ (\g\ + \h\f- 2 \ | £2 . Now 
applying Holder's inequality with conjugate exponents and we see this quantity is 

— / \ — 

<<A ~ {p 2) W 9 ~ k{ j 2 ■ (X (H+|/i|)2 ) 2 <<A ~ (p " 2) i^- /i ii3^- 

By a change of variable, we then have 

poo 1 /-oo . .1 

/ (logN 2 (£,t))idt<.\-(P-V (log N^(V m , z)) 2 dz. (17) 
Here, N 2 (P m , z) denote the corresponding entropy numbers of V m considered as a subset of 

3-p 

2 

the space L 3 ~p(T, n). 

Applying Lemma O we obtain 



jf (iog^w„,*))' « < c pv ^ (io g (| + (J y'log (i) + . 

where C p and z/ p denote values that depend only on p. We note that u p > 2 for 2 < p < 3. 
Thus, we deduce 



^ (log N_i_(V m ,t)Y dt < C P V^ (log Q + l) 2 (18) 

for some constant C p depending only on p (we have abused notation a bit here, as this is not 
the same Cp as in the previous statement). 
Combining (fT7|) and (fTBj) . we see that 

(log N 2 (£, 0)5 dt < C p • A~(p- 2 ) • (log + l) ) 2 • (19) 
We can now use this to obtain the following bound on the righthand side of (fT6l) : 

^h^A^h -- ■ ^ G) ) " 4 ( tos (£ + ) 4 ■ 
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Combining this with (|15|) . we conclude that 



sup 

\W\<m 
\ci\<2/m 



I 

J J 



T K 



A 



LiO (duj) 



<7o 



mlog (V 



A 



+ C p • A 



log (I) 



This quantity will be < 1 for a choice of A that is 



1 p-2 

A <„ 5iR— + 



log(|) 



+ 



log AT 



2p 



log 



1> 



□ 



4.2 Moving to General Coefficients 

Lemma 21. Xei . . . ,<pN be orthonormal functions on a probability space T, /i satisfying 
\4>i\ — A everywhere on T for all i. Let < 5 < 1 and let be independent, {0, 1} -valued 

random variables of mean 5 on a probability space £1. Let 1 < K < oo, 2 < p < 3, 1 < M < A" 
and qo > 1. T/ien i/iere exists a 



A < p ^AT^vdogM + 



go + log log M 



VlogM + 



log AT 

Ml) 



2p 



VlogM 



such that 



UK 



sup 



2<1 



A 



|sMppart({ai})|< M 



< 1. 



L90 (da,) 

Proof. For any fixed values {a,} with support size < M such that ^ a? < 1, we divide them 
into logM levels corresponding to powers of 2. More precisely, for each m that is a power of 2 
that is > 1 and < M, we let S m denote the set of indices i such that l/m < |aj| 2 < 2/m. For 
m = 2^ M \ we define S m to be the set of all indices i such that |aj| 2 < 2/m. We then have 
that [TV] is the disjoint union of S m as logm ranges from 1 to [~logM~|. Note that \S m \ < m for 
each m. We also define d m = Yli^s m l a »l 2 ^ or eacn m - We note that ^ m d m < 1. For notational 
convenience, we also define the quantity L> := \/^m- 
We then observe (for a fixed w G il) 



M x 



K 




Appealing to the convexity of and the linearity of the integral, we see this is 



< 



'in I 

If 



D 



^2 a i&( U ) 
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For each ?7i j we define the random variable Vd rn as: 



VD,m{u) = SUp 
|A|<m, 
\d\ 2 <2/m 



We now have 



sup / r K 

\\{<H}\\ e 2<l JT 
| support({ai }) | < M 



A 



L'o (du>) 



< 



sup £> 1 ^ y/d^Vp' 



2m|| £ 2< 1 



L9o (do,) 



We note that L> = ^ m \/^m y/log M by the Cauchy-Schwarz inequality (recall there are 
only [logM] values of m). Therefore, since VD,m is non-decreasing as a function of D, the 
quantity above is 



\fa\n SUp Vb,m 


< 


\ m / 





sup^Vci 



L"Jo (did) 



Applying Lemma [T2l we see that this is 



<C sup 



sup 

\A\<m, 
\d\ 2 <2/m 



L-Jo+log LogM (dw) 



By setting 



A <C P JJ^Vy/logM + 

and applying Lemma |2"01 we obtain the result. 
4.3 Obtaining a Good Partition 

Lemma 22. Let <fi\, . . . , 0jy 6e orthonormal functions on a probability space T,/i satisfying 
|<fo| < -A everywhere on T for all i. Let 1 < M < L < N and 7 > 1. ZioWs mf/i probability at 
least 1 — cN~~ ( (for some universal constant c) that a subset L of N of size L chosen uniformly 
randomly satisfies 

sup / T K I — ^ I < 2 

HK}|| <2 <i 

support({cii})<M 

for a choice of A that is 



q + log log M 



log(V 



A/log M + 



log AT 

Mi) 



1 



VlogM 



□ 



A 



(20) 



^«-(l) J ^^ + iS) 



A/logM + 



log A 



1 

2p 



VbgM. 
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Proof. We fix a value of A suitable to apply Lemma [22 (with 5 = L/N and go = 27log(iV) 
fixed). For a real value t > 1, we let Et denote the event that a set S of size of L selected 
uniformly at random from [N] satisfies 



sup 

IIK}]| f 2<l 
support ({a,i})<M 



A 



< t. 



Similarly, for independent selectors with mean 5 = L/N, we let E' t denote the event 



that 



sup 

II{«H}[|£2<1 

support ({oj})<M 



A 



< t. 



We begin by obtaining a lower bound on the probability of the event E' t using Lemma [2T1 
We have by Chebyshev's inequality: 



sup 

\\{ai}\\ e2 <lJ'~ 



A 



> i 



< t- qo . 



[Eft > 1 - t" 90 . 



Hence, 

Next we observe the relationship between ¥[Et] and P [-£>£]. By applying Lemma [13] to the 
complements of the events Et and E' t , we have 1 — P[£t] <C \/£ • (1 — P[^]). This implies that 

¥[E t ] > 1 - CVl ■ t~ qo > 1 - C^N • t- 2 T 1 °s( JV ) 

for some positive constant C. 

Thus for a uniformly random subset of size L, the probability of the event E2, that is that 
(1201) holds, is at least 1 - cN^. 

□ 

Corollary 23. Let {4>i]f = i be a bounded orthonormal system. Let 1 < L < N and 7 > 1. It 
holds with probability at least 1 — cN~~' (for some universal constant c) that a subset of S C [N] 
of size L chosen uniformly randomly satisfies: 



sup 

\\{ai}\\ e2 <l 
support({a,i})<M 



E' 



(Ar/i) l/(2p-4) 



logiV 
log(f) 



•v/logM 



for all M in the range 1 < M < L. 



Proof. We apply the previous lemma with parameters 7 + 1 and K := (^) 2p ~ 4 and employ a 
union bound over the < TV" values of M. We note that once we have a A such that 



E 



A 



< 2, 



we can multiply A by a constant to obtain an upper bound on the r^-norm. 



□ 



Corollary 24. Let {4>i]f =l be a bounded orthonormal system. For any 7 > 1, it holds with 
probability at least 1 — cN~" / (for some universal c) that a uniformly randomly selected permu- 
tation a : [N] — > [N] satisfies the following. If I is an subinterval in [N] then (for all M < \I\) 
we have 



sup 

ll{«i}||<a<l 

support( {cii } ) < M 



(JV/|J|)V(2p-4) 



logiV 



^log M 



(uniformly in the choice of interval I and M). 



18 



Proof. We apply the previous corollary with the parameter 7+2 for each interval I with L := \I\. 
We then employ the union bound over the ~ iV 2 intervals. Note that for each I, the image of 
/ under a randomly chosen permutation is equivalent to a randomly chosen subset of size |/| 
inside [N]. 

□ 



5 Bourgain's Theorem 



The estimates in the previous section are not strong enough to deduce Theorem [TJ however they 
do allow us to reprove Bourgain's theorem and show that it holds with large probability, a fact 
we will require later. 

Proposition 25. Let {<f> n }^ =1 denote a bounded orthonormal system. For any 7 > 1 it holds 
with probability at least 1 — ciV~ 7 (for some universal c) that a uniformly randomly selected 
permutation a : [N] — > [N] satisfies 

||AWI|2< 7 loglog(iV)||/|| 2 

where M a f{x) := maxi< N X)n=l a n4>c(n) ( x ) f or al1 f = J2ne[N] a n4>a(n) in the span of the 
system. 



Proof. We select a permutation a satisfying the conclusion of Corollary 



We fix / 



^2ne[N] a n ( Pa(n) and we consider intervals Ik )S and points ik )S m a corresponding mass de- 
composition of [N]. For each k, we define It to be the collection of intervals It, s such that 
\h,s\ < 2~ k / 2 N and J| to be the collection of intervals Ifc jS such that \Ik, s \ > 2r k l 2 N . We 
observe that < 2 k / 2 . 

At each fixed point x G T, the value of M a f(x) is achieved on some subinterval of [N] that 
can be decomposed into a union of dyadic intervals Ik, s and points s such that there is at most 
one interval Ik s f° r each value of k. We let k* := [15 log log(iV)] . We then have the pointwise 
inequality 

V /2 / 

|2 



M a f{x) « E 



1/2 



E lA*,. 



\h, s ei b k 



J 



k<k* 



+ E I E \Ks 



I 



+ ^max|/ ife J + 



l/p 



E 

, l<s<2 fe * 



1/4. J 



J 



1 

12 \ 2 



To see this, note that yY2i k \fh s\ 2 J * ^ s an u PP er bound on the largest value of \fi ka \ over all 

s such that /fc jS S I? for each k (for example), and the final term above captures the contribution 
from dyadic subintervals for values of k > k* . 

We now consider the L 2 norm of each of these terms. Recall that f\fi ks \ 2 < 2 _fe and 

< 2 fc / 2 ; thus we have 





( \ 


1/2 




f \ 


1/2 


E 


E 1/4J 2 




sE 


E H/jfc,*lli 


<^ 2 - fc/4<<1 


k 






2 
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Next, 



k<k* 





1/2 




E \Kf 


1 








2 



1/2 



E l-^M 

a<s<2 fc 



< log log(iV). 



We also have that, using that |^ n (x)| < A and ||/i fc3 ||2 < 2 , we have the pointwise bound 

£max|/U«^2- fc / 2 «l. 



Finally, we must consider the quantity 

/ 



E 



Ks<2 



fc* 



/ 



Using Corollary[2^1and Lemma [HI we may decompose L = Gi k „ a +Ei k , where \ \Gj kt J| p <C Pi7 
log 3 / 4 (iV)||/ /fc . J| 2 and ||£ Jfc , J| 2 « p , 7 i^( 2 -P)/ 2 log 7 / 4 (iV)||/ 4 , J| 2 where if = f tt^-t^. For 



14*,, I 



k* 



I k *, s € we have |I fcv | < 2- fe */ 2 iV, so K > 2 4 (f- 2 ) . 
Setting p = 5/2, we now have 



i/p 



E i<W 

a<s<2 fc * 



< 



i/p 



E i G vj p 

a<s<2 fe * 



2/5 



E ikv.i$ 



,Ks<2*' 



2/5 



«log 3 / 4 (iV)[ E H/V. 

,l<s<2 fc * 



1 5/2 



Now Ei< s < 2 fc* H/4*,Jl2 /2 < 2fc * (2~ fc */2) 5 / 2 < log- 15 / 4 (A^), which implies the quantity is 

<log 3 / 4 (iV)log- 3/2 (A0 < 1. 

We last consider 



E I*W 



. l<s<2 fe 



/ 



< 



\ 



/ £ "V 



l<s<2 fe 



£ log 7 / 2 (iV)K 2 ^||/ /fc ,, 



l<s<2 fe 

V fc * ,8 eJjJ. 



/ 



We recall that K 



N \ 2(p-2) 



fc* 



,3-p 



7 — , , so for I fc%s E /£„ we have A" > 2 4 (f- 2 ) . Thus, K~ < 2" r, 
so the quantity above is <C log 7//4 (iV) log _15/,8 (iV) <C 1. This completes the proof. □ 
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6 Improving the Bound by Passing to Random Subsets 
6.1 Bounding the Expectation 

For a fixed interval I C [N] and a bounded ONS {(f>i}iLii we define the operator Uj from l 2 N 
to the Banach space of real valued functions on T with norm || • ||r K °v mapping a sequence 
{a.i}f =1 to the function Yliti a i4>i- 

Lemma 26. Let {4>i}f =l be a bounded ONS. Let L be a fixed interval satisfying 



sup 

IIR}ll f 2<l 



iei 



< A 



(21) 



for a fixed A and K. Let be independent selectors with mean v . We let S C I denote 

the set of indices i £ L such that £j = 1. Then 



E 



sup 

ll{oi}||,a<l 



ies 



A 2 + C p log N 



log (i) 



Proof. We apply Lemma [TOl to the operator Ui. This yields 



E 



sup 

IIK}ll f 2<l 



ies 



rv 



C p log iV + sup 

IIK}|L 2 <i 



y~]ai(/>i 



Using the interval I satisfies (|2ip . we see this quantity is <C 



A 2 +C p logjV 

Mi) ' 



□ 



6.2 Bounding the Concentration 

We now consider the concentration of the random variable su P||{a i }|| < , 2 <i I l^ieS a «^| lr aroun d 
its expectation. 

Lemma 27. Let {(f>i}fLi be a bounded ONS. Fix < /3 < 1 and let L < N, 5 := L/N. Then 
for any 7 > 1 it holds with probability at least 1 — cN~^ (for some universal c) that a uniformly 
randomly selected subset S of [N] of size L satisfies 



sup 

IIK}II £ 2<1 



ies 



log 3 / 4 (AQ 



(22) 



for K := min j Q) 1/(4p 8 \log^(iV)} and 2 < p < 3. 

Proof. We will pass to a set of size L in two stages. We first define an intermediate value 



Li > L such that ^ = ^. We will set if := min j (^J "" ' , log 2 *- 4 (iV)j>. We then apply 

Corollary [23 to conclude that with sufficiently high probability, say at least 1 — cN~ (~/+ 2 ) , a 
random subset W of size Li inside [N] will satisfy 



xl/(2p-2) 
£L\ ncr2 P -4| 



sup 

IIK}|L 2 <i 



logN 



VlogLi. 



(23) 
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We now condition on the above result, and consider choosing a random subset S C W of 
size L. We first note that for any such set S, 



ies 



sup (y\ai4>i,g), 



where || • ||r* denotes the dual norm to || • ||r A -- Using the Cauchy-Schwarz inequality, we then 
see that 



sup 

ll{«*}||/a<l 



sup sup y~]ai{4>i,g) < sup V|(</>i,s) 
IIK}||,2<l|| 9 ||r.<l~^ ll<?llr*<l \~ts 



(24) 



(<t>i9) 



2"' 



we see that the inequality in (|24p is an 



In fact, noting that one can set ai 
equality. 

We let C be a threshold parameter that we will specify later. We define the functions 
Xi, X2 '■ - > R as follows: 



x, if \x\ < C; 
0, otherwise 



X2(x) 



0, if |x| < C; 
x, otherwise 



We then have 



sup T\(&,g)\ 2 < sup Vxi (\(&,g)\ 2 ) + sup Vx 2 (|(^»| 2 ). (25) 
IMIr*<l~^ IMIr*<l^ V ' [Wlr'<l^s V ' 

We will deal separately with the two quantities in (|25p . To address the first quantity, we will 
employ Lemma [TTJ In this case, our random variables Yi, . . . , Yjv are defined as follows. We let 
fi denote the probability space of the independent selectors each with mean v := j^. 

Then each uj £ £1 is associated to a subset S Q I. (This distribution of S differs of course from 
selecting a set of size exactly L, but we will analyze the relevant quantity in this case first and 
then derive a bound for the case of fixed size.) We define Yiito) to be equal to <j>i when i G S, 
and equal to otherwise (more formally, the constant zero function from T to C). 

For notational convenience, we defined the random variable Z : — > R by 



Z{u)= sup Vxi (\((t>i,g}\ 2 ) , 



where S is determined from cj as described above. Applying Lemma [TTT it follows that (for 
every positive real number r) 



\Z -E\Z]\> r ]£:;,xp | -J log (l+ «1 



(26) 



where ft is a positive constant, and <r 2 = sup| | 5 | | r , <i SieS X 1 ( 1(^)5)1' 
Recalling our definition of Z, we observe 



Z{J)< sup V" K^,^) | 2 = sup 

llflllr*<i i e s Il{^}|li2<i 



ies 
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(We have used here that (|24p is actually an equality.) Consequently, 

2 

E[Z] < E ' 



sup 

IIK}|L?2<1 



Applying Lemma l26| we have 



E\Z] < 



A 2 + C p logiV 

log (a ' 



where with sufficient probability we have A <C Pi7 log 4 (TV) log 4 j y/\ogL\ from ([23]) . Thus, 
there exist a constant such that 



Z > t + A x 



X 2 + C p logN 
log (i) 



< 3 exp — — log 1 + 



kC 



Ct 
Ek 2 ] 



(27) 



It remains to bound the quantity 

E[cr 2 ] = E 

Employing line (7.10) on p. 141 of [8], we see this is 



sup y) xi (i(0t>fl)r 

9\\r*<l i&S ] V 



« sup Ve[(xi <7>| 2 )) 5 



+ CE 



I9llr*<l i6/ 

By removing the cutoff at C, the latter quantity in ([28]) is 



sup XI (I f 

s||r*<l ieS 



(28) 



< CE 



sup y^K0i,5)i' 

l9llr*<l ieS 



CE 



sup 

ll{oi}[[<a<l 



y 



< C 



A 2 + C p logiV 

log m . 



where we have recalled that (|24p is an equality and have applied Lemma [ 

We now consider the first quantity in (|28p . Using the definition of xi an d Lemma [3 we see 
it is 

«C sup ]Te[|<^, 9 )| 2 ] «Cz. sup IMI 2 «CV J FC 2 f- 2 . 



lr*<l ie7 



lflllr*<l 

Putting this together we have, for universal constants A% and A2, that 

A 2 + C p logiv" 



Z > t + A 



1 • 



logQ) 



< 3 exp 



V 



1 + 



Cr 



\ 



V 



^C^ 2 K2p-2 + A 2 C ( X2 l C g P { ^ N J J 



\ 



(29) 



We now set t = A\ - — f^-f — • Since 

K 2 P -2 

< — , we see that the quantity inside the logarithm 
in ([29]) above is at least 1 + err, where a is a positive constant depending on p, 7. Since v > jj, 
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it is clear that r is bounded away from 0. Thus, it suffices to take C = ^4 7iP log~ 1 (A r ) (for some 
constant -A 7iP dependent only on p and 7). 

Using Lemma [T3"] we can derive an analogous upper bound on the probability of the event 
that 

sup Vxi(l<^,3>| 2 ) >2r 
Il9llr*<l~^ V ' 

for a randomly chosen set S of size L (inside W). The extra factor of < can be easily 
accommodated by choosing C to be a sufficiently high power of log N. 
We note that 

_ A 2 + C p logiV log 3/2 (iV) log AT 

1- M£) m iog 3/2 (^) M£) 

whenever ([23]) holds. Thus, the square root of this quantity is <C P , 7 r^u j w\ as required. 

We return to consider the second quantity in (|25|) . By Bessel's inequality, the number of 
nonzero terms appearing in the sum is at most C~ supiijji <j ||/i|| 2 2- From Lemma [7] we see 
that this is C~ 1 p 2 K 2p ~ 2 . 

We then have 

sup Vx2 ( \{(pi,h)\ 2 ) < sup sup V|((^,/i)| 2 . (30) 

INIr*<l^ V ' |[fc|lr«<l ^ 

" |/|<c-V^ 2p " 2 

We next observe that 

sup sup V" |(</>i, /i)| 2 < sup sup I Oj((pi,h) I . 

I|h||r*<l IIK}||, 2 <1 ||^|| r *<l / 

|/|<C-VAT 2p - 2 ISupportaaJJI^-V^- 2 

(31) 

To see this, note that on can set ai := _ f or i ' m a set I of size at most 

(E j6 /K<^>l 2 ) 5 

Combining ([30]) and ([31~]) . we see that 



sup yZx2 (\{<pi,h)\ 2 ) < sup sup [y2ai((f)i,h)) 

^llr*<l ie5 V ' \\{o.i)\\fl<\ IHIr*<l VieVK / 



IWIr*<l 

|Support({ai})|<C-VK 2 P- 2 



Employing Lemma [5] this is 



"C sup 

IIK}||,2<1 
|Support({a;})|<C-V^ 2p ~ 2 



jew 



By (our earlier application of) Corollary 123] we see this is 

1 1 

2 / \ 2 
loeiV \ , . 1 o T ,o„_o, / logiV 



logff I log'" 



«*w I ~ — 777Y I iog(c-V 2 ^ 2p - 2 ) = ( ] log(A^i log(iV)p 2 ^ 2p - 

loeiV \ 2 

«^ I / w \ loglog(iV). (32) 
Clearly this term is acceptable and completes the proof. □ 
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6.3 Deriving the Main Theorem 



Using Lemma [27] with Corollary [T7J allows us to take r = 3/4 and p = 3 — e' in Lemma [19] 
for any < e' < 1. More precisely, we can set := i/2+3/4-l/(3-e') = 5 _ 4 ^ 3 _ e /- ) ■ To obtain 
the hypotheses needed to apply Lemma UM we employ Corollary 1 171 To obtain the hypotheses 
needed to apply this corollary, we employ Lemma [271 with 5 := 2 _21og ( N \ Note here that K 
will be set so that log r+1 \N)K 2 ~ p / 2 <C 1 as required. We also employ Proposition [52] Then, 
with e defined by ^ + e = io_g/ 3 (3_ e ') > we obtain: 

Theorem 1. Let <!> := {(p n }n=i denote a bounded ONS. Let e > 0,7 > 1. Then, with probability 
at least 1 — cN~^ (for some universal c), for a uniformly random permutation a : [N] —> [N], 
the system {(p a (n)}^ =1 will satisfy 

||V 2 /||2«e, 7 logA +e (iV)||/|| 2 . 

7 Concluding Remarks 

We remarked earlier that when proving the main probabilistic estimate, Lemma [271 we were 
unable to work exclusively in either Bourgain or Talagrand's frameworks, and had to use a 
hybrid of the two. We briefly expound on these issues and how our approach addresses them. 

Bourgain's approach to the A(p)-problem makes very careful use of the structure of the LP 
norms, in particular relying on delicate pointwise inequalities involving the function x — > \x\ p . 
We have been unable to find appropriate analogs of these arguments in the case of the more 
awkward Tk norms of relevance here. Many of these issues can be avoided by first pigeonholing 
the coefficients to a single level set (indeed even Bourgain's paper [2] can be substantially 
simplified in this case, see for instance the argument sketched in [3]) however this introduces a 
logarithmic factor that is fatal for the current application (although, this is essentially the idea 
behind the estimates in Section 4.1, and the prior work in the maximal and V r cases). 

In contrast, Talagrand's methods can be applied to a very general class of norms, including 
those considered here (as we have partly done via Lemma [TU|) . However, when used in the 
framework of |16| (such as Theorem 1.2 there), one 'loses' a factor of N~ € in the density of the 
resulting subsets. In the case of L p norms and polynomially sparse sets (of relevance in the 
A(p)-problem), Talagrand is able to decompose the norm into two parts. On one part, better 
estimates are true for denser sets (and thus the loss of a N~ e factor is acceptable) and his 
very general theorem can be applied. The other part can be handled with alternate (and more 
elementary) methods. Again, we have been unable to find an analog of these arguments in the 
setting of Tk norms and on the denser sets of relevance here. 

Our approach has been to use Bourgain's arguments to prove that the Tx norm on a 
random subset of the relevant density is within a logarithmic factor of what is needed (we note 
that the arguments from section 3 of [16] are too crude for the purposes here). We then use the 
orthonormal system associated to this set as the starting point for the application of Talagrand's 
methods, which are invoked by passing to a slightly sparser random subset. 

This approach gives satisfactory control over the expectation of the norm on the resulting 
subset, however our application requires more information about the concentration around the 
expectation. Lemma [TT1 provides a useful estimate in this respect, however this alone does not 
seem sufficient for our application. Fortunately, this estimate is insufficient only when some of 
the coefficients are large and we are able to adequately control the contribution from terms with 
large coefficients by returning to Bourgain's argument with pigeonholing, since the number of 
relevant level sets is then reduced. Likely, a better understanding/treatment of these issues will 
lead to an improved estimate. 
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Some final remarks: 

Remark 28. As with many orthonormal system results (for instance those of J||/ and our 
arguments can be easily modified to treat the more general case of Hilbertian systems. 

Remark 29. A quantitative form of Kolmogorov's rearrangement theorem due to Nakata 
states that the first N exponentials can be reordered such that there exists an f in their span such 
that \\Mf\\i,2 S> log 1//4 (A^)||/||^2. From modulation and dilation symmetries, this holds for any 
set of N exponentials associated to an arithmetic progression. Notice that a random reordering 
of [N] will contain an increasing arithmetic progression of length at least S> log c (iV) (for some 
absolute constant c > 0) in this 'bad' ordering. Thus while a random reordering decreases the 
norm of the V 2 operator, it increases the norm of the M. operator to (at least) (log log (iV)) 1 / 4 , 
with large probability. 

Remark 30. In a related paper fii]/ . we have shown that given an arbitrary ONS of length N , 
one may find an alternate ONS that spans the same space such that \\V 2 f\\^2 <C y'log log (N) 1 1 / 1 1 ip. , 
which is sharp. Where the results in our current work rely on estimates for what may be called 
selector processes, there the problem can be reduced to estimates for Gaussian processes. Gen- 
erally, estimates in that setting are stronger and better understood. 
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